Neural network based integration of multiple confidence measures for OOV detection
نویسندگان
چکیده
In this paper we present a novel method to reject OOV words for speaker dependent dynamic command set recognition. The OOV rejection problem is regarded as the designing of recognizer with two classes: In-Vocabulary command and OOV command. Multiple soundly confidence measures derived from likelihood score of acoustic match and prosody match are defined and compete with each other at the same level automatically within neural network framework, thus elude choosing balanced sensitive threshold like traditional strategy. The network weights are trained according to Minimum Misclassification Error criterion. The confidence measures take whole command set into account, and objectively describe the difference between the top one and alternative hypotheses. Experimental results show that neural network based combination is rational, reliable and stable with average total error rates 9.3%, outperforming any single confidence measure threshold approach. Also the across verification results show that trained network is independent of speaker, gender and command set. Although there is performance degradation when exported to another conditions, it is acceptable in many applications.
منابع مشابه
Out-of-Vocabulary Spoken Term Detection
Spoken term detection (STD) is a fundamental task for multimedia information retrieval. A major challenge faced by an STD system is the serious performance reduction when detecting out-of-vocabulary (OOV) terms. The difficulties arise not only from the absence of pronunciations for such terms in the system dictionaries, but from intrinsic uncertainty in pronunciations, significant diversity in ...
متن کاملIdentification of Multiple Input-multiple Output Non-linear System Cement Rotary Kiln using Stochastic Gradient-based Rough-neural Network
Because of the existing interactions among the variables of a multiple input-multiple output (MIMO) nonlinear system, its identification is a difficult task, particularly in the presence of uncertainties. Cement rotary kiln (CRK) is a MIMO nonlinear system in the cement factory with a complicated mechanism and uncertain disturbances. The identification of CRK is very important for different pur...
متن کاملAnalysis and Diagnosis of Partial Discharge of Power Capacitors Using Extension Neural Network Algorithm and Synchronous Detection Based Chaos Theory
Power capacitors are important equipment of the power systems that are being operated in high voltage levels at high temperatures for long periods. As time goes on, their insulation fracture rate increases, and partial discharge is the most important cause of their fracture. Therefore, fast and accurate methods have great importance to accurately diagnosis the partial discharge. Conventional me...
متن کاملSpoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting
Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...
متن کاملDaily Pan Evaporation Estimation Using Artificial Neural Network-based Models
Accurate estimation of evaporation is important for design, planning and operation of water systems. In arid zones where water resources are scarce, the estimation of this loss becomes more interesting in the planning and management of irrigation practices. This paper investigates the ability of artificial neural networks (ANNs) technique to improve the accuracy of daily evaporation estimation....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000